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Abstract 


It  Is  proved  that  in  euclidean  n-space  the  maximum  M(p)  and 
minimum  m(p)  of  a  fixed  positive  definite  quadratic  polynomial  Q 
on  spheres  with  fixed  center  are  both  convex  functions  of  the  radius  p 
of  the  sphere.  In  the  proof,  which  uses  elementary  calculus  and  a 
result  of  Forsythe  ar.d  Golub,  m"(p)  and  M"(p)  are  shown  to  exist 
and  lie  in  the  interval  [2A  ,£>n]  ,  where  are  the  eigenvalues  of 

the  quadratic  form  of  Q  .  Hence  m"(p)  >  0  and  M"(p)  >  0  . 


Summary 

Let  A  be  a  given  symmetric,  nonsingular  matrix  of  real  elements 
and  order  n  .  Lee  b  be  a  given  column  vector  of  n  real  elements. 

For  each  real  column  n- vector  x  ,  the  nonhomogoncous  quadratic  polynomial 

Q,(x)  =  (x-b)11  A(x-b) 

(T  denotes  transpose)  is  a  real  number.  Let  ^  <  ...  be 

the  (necessarily)  real  eigenvalues  of  A  .  Let  m(p)  be  the  minimum  of 

T  2 

Q(x)  on  the  sphere  =  {x:  x  x  =  p  }  ,  and  let  M(p)  be  the  maximum 
of  Q(x)  on  .  M.  J.  D.  Powell  asked  the  author  whether  m(p)  is  a 
convex  function  of  p  when  A  is  positive  definite.  An  affirmative 
answer  is  given  by  the  theorem: 

(1)  Theorem.  If  A  is  positive  definite  (i.e.,  if  0  <  X  )  ,  then  both 
m(p)  and  M(p)  are  convex  functions  of  p  ,  for  all  p  >  0  . 

Theorem  (l)  will  follow  from  the  following  result: 

(2)  Theorem.  Let  A  be  any  nonsingular  matrix.  Then  for  p  >  0  , 
the  second  derivatives  m"(p)  and  M"(p)  both  exist,  and 

(5)  m"  (p)  >  2^  .and  M"(p)  >  2>^  . 

Equality  occurs  in  (j5 )  if  and  only  if  Ab  =  A^b  .  Moreover, 

(L)  m"  (p )  <  2An  and  M"(p)  <  2An 

and  equality  occurs  in  (L)  if  and  only  if  Ab  =  A^b  . 
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Review  ol‘  Previous  Work 


The  proof  of  Theorem  (2)  is  based  on  techniques  developed  in  Forsythe 
and  Golub  [l],  which  dealt  only  with  the  case  p  =  1  .  The  relevant 
results  of  [l]  are  now  summarized  and  extended  to  general  p  . 
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Letting  x  =■•  x^g  ,  we  find  from  (t>)  that 
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so  that  (6)  becomes 
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r(A)  =  £ 
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For  eacli  given  value  of  p  >  0  ,  equation  (8)  determines  from  2  to  2n 
real  values  of  A  .  For  each  A  so  determined,  equation  (5)  determines  one 
or  more  vectors  x^  (if  all  b^  0  ,  then  x^  is  unique).  For  any  , 

we  have 


(9) 


q(xA)  =  f (A)  , 


where 


c. 


Now  Q(x)  is  stationary  with  respect  to  at  any  .  For  given  p 

let  A^  =  A^(p)  and  =  AR(p)  be  the  smallest  resp.  largest  values 
of  A  satisfying  equation  (8).  Theorem  (4.1)  of  [l]  states  that  f(A  ) 
and  f(A^)  are  the  minimum  resp.  maximum  values  of  Q(x)  on  . 

Much  of  [1]  was  devoted  to  the  singular  cases  where  some  tu  =  0  . 

For  the  present  investigation,  where  we  are  interested  only  in  the 
values  of  Q(x)  ,  we  simply  omit  from  the  sums  (S)  and  (10)  all  terms 
with  b^  =  0  ,  and  reduce  n  ,  if  necessary.  Having  done  that,  it  is  then 
clear  from  (8)  that,  for  any  p  , 

(11)  Al  <  and  Xn  <  AR  . 

This  concludes  the  necessary  summary  of  [ l] . 

As  a  digression,  the  author  notes  that  the  main  theorems  (2.7 )  and 
(4.1)  of  [l]  were  proved  in  [  l]  by  studying  f(x)  and  g(x)  for  complex 
values  of  X  .  In  late  1965,  Professor  W.  Kahan  [unpublished]  showed  us 
how  to  prove  those  theorems  more  simply,  using  only  real  values  of  A  . 

Proof  of  Theorem  (2). 

With  the  above  apparatus  our  problem  is  reduced  to  an  exercise  in  the 
differential  calculus.  For  each  p  >  0  we  determine  a  unique  Lagrange 
multiplier  X  =  x(p)  from  (8)  —  eituer  the  minimal  A^  or  maximal  AR  , 

For  ease  of  exposition,  suppose  x(p)  =  A^  .  Then  the  function 


(12) 


m(p)  =  f(A(p)) 


is  determined  from  (10).  Gince  f(x)  and  g(x)  are  analytic  for 


X  <  X^  ,  the  function  m(p)  has  derivatives  of  all  order.  We  shall 
determine  m"(p)  by  calculus.  To  simplify  some  expressions,  we 
introduce  the  abbreviations 

n  X2b2 

(13)  Q_  «  £  -i-  -1  g  (p  -  2,  3,  4)  . 

P  i«l  (x.-x)2 

Differentiating  (10)  and  simplifying,  we  find: 

(H)  I  "  2^3  ; 

d2 

(15)  — \  -  2a  +  6xau 


Now  equation  (8)  states  that,  when  X  X(p)  , 


(16) 


a 


2  c  P 


Differentiating  (8)  twice  with  respect  to  p  yields 


(17) 


-  p  » 


(IS) 


+  3(§>  a. 


dp" 


1  . 


Solving  (17)  and  (18)  in  turn,  we  find 


(19) 


(20) 


dX  _ 

dp  Ct 
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Now,  by  the  chain  rule, 


and 


dm  df  .  dX 
dp  "  dX  dp 


(21) 


d2f  /d)u2  df  dfx 
:x2  V  dx  ‘  dp2 


We  now  substitute  into  (21)  the  expressions  (l4),  (].5),  ( 19) >  and  (20). 
We  find  that 

.2  2 

(22)  m"(p)  =  — ?  e  (2a  +  6X0!.  )  ^  +  2X0! 

dp  CC 


Hence 


§  m"  (p ) 


i  (Xa5  +  a2)  ,  by  (l6), 


Irrrplifying, 


i»-(P)  -  £-  E 


3  1*1  (^-x)5 


or 


(23) 


Formula  (23)  is  the  end  of  our  calculus  exercise.  In  it,  X  is 


* 

determined  from  solving  (8),  Note  by  (ll)  that  the  factors  (X^-Xr  all 
have  the  same  sign  for  i  ■  1,  2,...,  n  ,  whether  X  ■  or  X  -  Ap  . 
Hence  ^  m"(p)  is  a  weighted  average  with  positive  weights  of  the  {X^}  . 
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It  follows  that  ~  m"(p)  >  X^  ,  with  equality  only  when  all  X^  in  (2j) 
are  aqu-tl  to  X,  ,  i.e.,  if  b.  e  G  for  X.  >  X,  .  This  proves  (j), 
and  (4)  is  proved  analogously.  This  concludes  the  proof  of  Theorem  (2). 

It  would  be  desirable  to  have  a  simple  geometrical  proof. 

What  if  A  is  singular? 

If  A  is  singular,  that  is,  if  some  X^  =  0  ,  the  situation  is 
somewhat  more  complicated,  just  as  the  case  where  some  X^b^  =0  is 
complicated  in  [l].  Theorem  (2)  fails  to  hold  for  semidefinit.e  matrices, 
because  m"(p)  may  not  exist  for  some  p  ,  as  the  following  example  shows 

2  T 

(24)  Example .  For  n  =  2  let  Q(x)  s  (x2-l)  ,  where  x  =  (x^x^) 

Then 

°  <  p  <  1  , 

1  <  p  <  CD  f 

If  X^  *»  0  ,  the  Lagrange  multiplier  remains  at  X  *  0  for  all 
sufficiently  large  p  . 

Theorem  (l)  can  easily  be  extended  to  semidefinite  matrices  by 
continuity.  We  have 

(25)  Theorem.  If  A  is  positive  semidefinite  (i.e.,  if  0  <  X^  ), 
then  both  m(p)  and  M(p)  are  convex  functions  of  p  for  p  >  0 

In  proof,  we  note  that  m(p)  and  M(p)  are  continuous  functions  of 
the  elements  of  A  .  If  A  is  semidefinite,  it  can  be  approximated  by  ?. 
definite  matrix  A^  ,  for  which  and  are  convex,  with  ||A-A^||  <  t 

Letting  t  —  0  ,  we  find  that  m  ■  lim  m^  and  M  ■  lim  are  convex. 


i(p) 


so  m"(l)  does  not  exist. 
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